Localized Methods for Protein Interaction Prediction
نویسنده
چکیده
Identification of protein-protein interactions is important for drug design and the treatment of diseases. We propose a novel threading algorithm, LTHREADER, which generates accurate local sequence-structure alignments and integrates various statistical scores and experimental binding data to predict interactions. LTHREADER uses a profile of secondary structure and solvent accessibility predictions with residue contact maps to guide and constrain alignments. Using a decision tree classifier and low-throughput experimental data for training, it combines information inferred from statistical interaction potentials, energy functions, correlated mutations and conserved residue pairs to predict likely interactions. The significance of predicted interactions is evaluated using the scores for randomized binding surfaces within each family. We first apply our method to cytokines, which play a central role in the development of many diseases including cancer and inflammatory and autoimmune disorders. We tested our approach on two representative families from different structural classes (all-alpha and all-beta proteins) of cytokines. In comparison with the state-of-the-art threader RAPTOR, LTHREADER generates on average 20% more accurate alignments of interacting residues and shows dramatic improvement in prediction accuracy over existing methods. To further improve alignment accuracy for all PPI families, we also introduce the program CMAPi, a twodimensional dynamic programming algorithm that, given a pair of protein complexes, optimally aligns the contact maps of their interfaces. We demonstrate the efficacy of our algorithm on complexes from PPI families listed in the SCOPPI database and from highly divergent cytokine families. In comparison to existing techniques, CMAPi generates more accurate alignments of interacting residues within families of interacting proteins, especially for sequences with low similarity. Thesis Supervisor: Bonnie Berger Title: Professor of Applied Mathematics Acknowledgements First and foremost, I would like to express my deepest gratitude to Bonnie Berger. I consider myself extremely fortunate to have her as my advisor. Her advice and support, both academic and personal, have helped me deal some very difficult times during this decade long journey that began when I worked with her as a UROP student. Her neverending enthusiasm and belief not just in me, but all of her students is something that I will always cherish. I would also especially like to thank Jadwiga Bienkowska, who first inspired the research described in this thesis. I could not have asked for a more knowledgeable and enthusiastic collaborator. Her deep understanding of the field and creative problem solving ability helped us deal with many difficult challenges along the way. I also appreciate the amount of time and effort she put into this research even while working fulltime in industry. I want to thank Jinbo Xu and Rohit Singh for many helpful discussions along the way. Their vast knowledge and experience is far superior to mine. Also, I want to thanks all the members of the computational biology group for their thoughtful feedback during group meetings. Finally, above all, I want to thank my family for their constant love and support throughout my life. I owe everything to my parents who sacrificed so much to give me and my brother the opportunity to be where we are today. This thesis is as much for them as it is for me. And although my father is no longer here, I know he is sharing in my joy. He has been my constant source of inspiration throughout. This thesis was supported by Grant Number 1R01GM081871-01Al from NIH. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of NIH. Table of
منابع مشابه
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...
متن کاملDiscovering Domains Mediating Protein Interactions
Background: Protein-protein interactions do not provide any direct information regarding the domains within the proteins that mediate the interactions. The majority of proteins are multi domain proteins and the interaction between them is often defined by the pairs of their domains. Most of the former studies focus only on interacting domain pairs. However they do not consider the in...
متن کاملPrediction of Coffee Effects in Rats with Healthy and NAFLD Conditions Based on Protein-Protein Interaction Network Analysis
Background and objectives: Non-alcoholic fatty liver disease (NAFLD) is a common liver condition. On the other hand, coffee consumption has shown promising for gastrointestinal diseases. Detection of the most valuable biomarkers of decaffeinated coffee treatment in healthy and non-alcoholic fatty liver disease conditions was the aim of the present study. Methods:</stro...
متن کاملStudy of PKA binding sites in cAMP-signaling pathway using structural protein-protein interaction networks
Backgroud: Protein-protein interaction, plays a key role in signal transduction in signaling pathways. Different approaches are used for prediction of these interactions including experimental and computational approaches. In conventional node-edge protein-protein interaction networks, we can only see which proteins interact but ‘structural networks’ show us how these proteins inter...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملGenome-wide Protein-chemical Interaction Prediction
The analysis of protein-chemical reactions on a large scale is critical to understanding the complex interrelated mechanisms that govern biological life at the cellular level. Chemical proteomics is a new research area aimed at genome-wide screening of such chemical-protein interactions. Traditional approaches to such screening involve in vivo or in vitro experimentation, which while becoming f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009